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Abstract 


have  an  efficient  optimization  technique  for  a class  of  problems  is 
to  have  no  more  than  a tool.  Like  any  tool,  it  can  be  used  well  or  poorly. 
This  paper  is  about  how  to  use  one  such  tool  for  distribution  planning  pro- 
blems (see  the  companion  piece  by  A.  Geoffrion,  G.  Graves  and  L.  Lee, 
"Strategic  Distribution  System  Planning:  A Status  Report,"  Working  Paper 
272a,  March  1978).  Discussion  centers  on  four  topics  of  importance  in 
practical  applications : the  relationship  between  system  cost  and  the 
number  of  distribution  facilities,  sensitivity  analysis,  robustness  analysis, 
and  implementation  priority  analysis.  Each  of  these  topics  requires  the  use 
of  optimization  in  ways  that  are  sometimes  less  than  obvious.  Several  illus- 
trations are  drawn  from  actual  applications  in  the  auto  parts,  consumer 
products,  food,  and  mining  industries. 
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Good  models  and  powerful  optimization  software  do  not  a successful 
project  make.  Among  the  additional  necessary  ingredients  is  a generous 
measure  of  imagination  and  cunning  regarding  ways  to  use  the  model  and 
software  in  support  of  ultimate  project  objectives.  We  are  thus  led  to 
ask  what  types  of  optimization  runs  should  be  done  and  how  the  results 
should  be  interpreted. 

A general  conceptual  discussion  of  this  issue  in  the  context  of 
distribution  system  planning  has  been  given  elsewhere.—^  Our  purpose 
here  is  to  give  a more  detailed  discussion  of  four  particular  types  of 
optimization  runs  which  users  often  either  overlook  or  have  conceptual 

i 

difficulty  with.  They  are: 

A.  System  Cost  versus  Number  analysis 

B.  Sensitivity  analysis 

C.  Robustness  analysis 

D.  Implementation  priority  analysis. 

There  are,  of  course,  many  other  important  types  of  optimization  runs  beyond 
these  four.  Liberal  use  is  made  of  numerical  examples  drawn  from  actual 
applications  in  the  auto  parts,  consumer  products,  food  and  mining 
industries. 

The  reader  is  presumed  to  have  a general  familiarity  with  distribution 
system  planning  models  of  the  type  described  in  Ref.  5. 


— ^See  Ref.  3,  and  also  Sec.  4.2  of  Ref.  1 and  pp.  27-29  of  Ref.  2. 
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A.  THE  SYSTEM  COST  VS.  NUMBER  CURVE 

The  Stj S fern  Cobt  . Number  c uAva  (CVN  for  short)  gives  the  minimum 
possible  total  system  cost  as  a function  of  the  number  of  open  distribution 
centers.  Figure  1 gives  an  example  of  such  a curve  for  the  packaged 
goods  division  of  a mining  company  (the  model  has  23  product  groups,  12 
plants,  51  candidate  distribution  center  locations,  and  110  customer  zones) . 
The  lowest  system  cost  in  this  example  is  obtained  with  a certain  system 
having  n*=30  facilities  open.  The  cost  implications  of  departing  from  this 
number  of  facilities  can  be  seen  at  a glance.  For  instance,  there  are  sys- 
tems with  as  few  as  22  and  as  many  as  39  facilities  that  come  within  1% 
costwise  of  the  best  30  facility  system. 


Minimum 

Total 

System 

Cost 

in 

Millions 

of 

Dollars 


4.45 

4.44 

4.43 

4.42 

4.41 

4.40 

4.39 


Number  of  Open  Distribution  Centers 

Figure  1 

The  System  Cost  Vs.  Number  (CvN)  Curve  For  a 
Mining  Company 
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Figure  1 and  all  similar  figures  are  drawn  for  convenience  as  though 
n,  the  number  of  open  facilities,  were  a continuous-valued  rather  than  in- 
teger-valued variable. 

It  bears  emphasis  that  this  kind  of  curve  cannot  be  obtained  by  the  very 
common  practice ^of  simply  costing  out  a number  of  plausible  system  config- 
urations having  different  numbers  of  open  facilities.  That  would  just 
produce  some  points  above  the  CvN  curve.  The  ability  to  perform  global 
cost  minimization  subject  to  a constraint  on  the  number  of  facilities  is 
essential  in  order  to  obtain  a true  CvN  curve,  which  can  be  viewed  as  the 
lower  envelope  of  a scatterplot  of  cost  against  size  for  all  feasible  sys- 
tem designs.  Of  course,  all  problem  data  must  be  held  constant  during  the 
generation  of  this  curve;  different  cost/ demand/service  scenarios  lead  to 
different  curves. 

There  are  at  least  three  reasons  why  the  CvN  curve  is  valuable . 

These  are  taken  up  in  turn. 

1 . First  Rationale:  The  CvN  Curve  is  of  Direct  Interest  to  Management 

The  number  of  open  distribution  centers  is  a distribution  system  de- 
sign parameter  of  great  intrinsic  interest  to  the  manager  and  system 
analyst.  Whether  a rebalanced  system  should  have  more  or  fewer  facilities 
than  the  present  system,  and  by  how  much  the  total  number  should  change,  is 
almost  always  a major  issue.  The  CvN  curve  reveals  how  large  a penalty  is 
associated  with  departures  up  or  down  from  the  ideal  number  n*  of  facilities. 
Such  departures  can  be  desirable  for  a variety  of  hard-to-quantify  reasons 
exterior  to  the  formal  model.  To  illustrate:  (i)  if  the  current  number  of 
facilities  is  substantially  different  from  n*,  management  can  make  use  of 


y 


e.g.t  p.  227  of  Ref.  6. 
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the  penalty  information  to  design  a conservative  initial  strategy  which  goes 
only  part  way  toward  n*;  (ii)  management  may  prefer  to  have  more  than  n* 
facilities  for  the  sake  of  improved  customer  service;  (iii)  management  may 
prefer  to  have  fewer  than  n*  facilities  because  the  model  has  omitted  certain 
economy-of-scale  cost  savings  that  were  difficult  to  estimate. 

2 . Second  Rationale:  Getting  at  the  Cost/Customer  Service  Tradeoff 

The  CvN  curve  facilitates  making  the  difficult  but  important 
tradeoff  between  system  cost  and  customer  service  level.  This  comes 
about  because  numerical  generation  of  a CvN  curve  produces,  as  an  inevi- 
table by-product,  some  complete  system  designs  that  attain  (or  nearly 
attain)  the  curve  at  various  points.  The  customer  service  character- 
istics of  these  system  designs  can  be  summarized  in  convenient  terms,  say 
the  percentages  of  demand  that  can  be  filled  within  one-day,  two-day,  and 
three-day  delivery.  Unless  there  is  some  reason  for  considering 
! systems  with  fewer  than  n*  distribution  centers,  just  the  system  designs 

with  n*  or  more  facilities  need  be  so  summarized  since  smaller  systems 
have  higher  cost  and  generally  worse  customer  service.  Thus  we  obtain  a 
series  of  alternative  systems  with  progressively  higher  cost  but  with 
progressively  better  customer  service.  Management  is  then  in  a position 
to  work  out  an  informed  compromise  between  the  conflicting  objectives  of 
minimizing  distribution  cost  and  maximizing  customer  service. 

The  tradeoff  possibilities  between  cost  and  customer  service  can  be 
pictured  graphically  if  the  latter  is  measured  by  an  index  number 
such  as  the  systemwide  average  (or  median  or  maximum)  distance  between 
customers  and  their  assigned  distribution  centers.  For  the  firm  whose 


CvN  curve  is  shown  in  Figure  1,  the  demand-weighted  average  distance 
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varies  as  shown  in  Figure  2.  The  cost/service  tradeoff  posibilities  are 
shown  most  clearly  if  Figures  1 and  2 are  combined  as  in  Figure  3.  This 
is  the  "cost-service  tradeoff  curve"  which  management  must  ponder. 


Demand-Weighted 
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Number  of  Open  Distribution  Centers 
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Figure  2 

How  A Proximity  Measure  of  Customer  Service  Varies  With 
The  Number  of  Open  Facilities  (for  a Mining  Company) 


Minimum 
System  Cost 
in  Millions 
of  Dollars 


Figure  3 

A Combination  of  Figures  1 and  2:  How  Minimum  System  Cost 
Varies  With  a Proximity  Measure  of  Customer  Service  (for 
a Mining  Company) 


3 . Third  Rationale:  A Device  for  Handling  Problematical  Costs  and 


Savings  That  Are  Dependent  on  the  Number  of  Open  Facilities 
Usually  there  are  significant  categories  of  cost  or  potential  savings 
which  depend  strongly  on  the  number  of  open  distribution  centers. 

Linear  costs  of  this  sort  can,  of  course,  be  built  into  the  individual 
facility  fixed  costs,  but  nonlinear  costs  of  this  type  cannot  be  and  hence 
cause  technical  difficulty.  Or  there  may  be  so  much  uncertainty  surrounding 
particular  kinds  of  costs  that  the  project  team  is  reluctant  to  build  them 
into  the  model  on  an  equal  footing  with  the  more  certain  cost  categories. 
This  difficulty  is  particularly  pronounced  when  contemplating  a signifi- 
cant departure  from  the  current  number  of  distribution  facilities.  As  we 
shall  see,  nonlinear  and  uncertain  costs  of  this  type  can  be  handled  with 
ease  once  the  CvN  curve  is  available. 

Here  is  a partial  list  of  cost  categories  that  can  be  modeled  as  func- 
tions of  the  number  of  open  distribution  centers: 

(a)  Central  Administrative  Costs.  It  takes  space,  =taff,  and 
equipment  at  company  headquarters  to  coordinate  and  manage 
a network  of  warehouses.  Central  administrative  costs  in- 
crease with  increasing  numbers  of  warehouses. 

(b)  Cycle  and  Safety  Stock  Carrying  Costs.  More  distribution 
centers  means  more  total  system  inventory,  but  the  rate  of 
increase  diminishes  with  the  number  of  facilities.  Con- 
ventional wisdom,  which  draws  some  support  from  classical 
inventory  theory,  has  it  that  cycle  and  safety  stoc  will 
increase  approximately  in  proportion  to  the  number  of 
facilities  raised  to  an  exponential  power  between  1/2  and  1. 

It  is  usually  possible  in  principle  to  build  inventory 
carrying  costs  directly  into  the  distribution  center 
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throughput  cost  functions,  but  when  numerous  inventory 
assumptions  or  policies  need  to  be  tested  (e.g.,  alterna- 
tive assumptions  on  the  cost  of  capital  or  policies  re- 
garding allowable  stockout  probability)  it  may  be  more 
expedient  to  model  these  costs  as  a function  of  the  total 
number  of  open  facilities. 

(c)  Consolidation  Stock  Carrying  Costs.  As  the  number  of  dis- 
tribution centers  increases  and  the  average  throughput 
decreases,  there  may  be  an  increase  in  what  might  be  called 
"consolidation"  stocks.  This  refers  to  the  need  to  accumu- 
late a sufficient  quantity  of  unfilled  customer  orders  at 

a distribution  center  so  that  a reasonably  economical  de- 
livery load  can  be  assembled.  The  aim  is  to  avoid  the  high 
costs  of  small  shipments. 

(d)  Customer  Order  Size  Effects.  Customers  which  are  close  to 
a warehouse  generally  tend  to  order  more  frequently  and  in 
smaller  quantities  than  customers  which  are  farther  away. 

This  implies  that  delivery  costs  tend  to  increase  on  a 
$/cwt-mi  basis  as  the  number  of  facilities  increases. 

(e)  Inter-warehouse  Transfer  Costs.  The  more  distribution  cen- 
ters there  are,  the  greater  the  coordination  problems  and 
hence  the  greater  the  tendency  to  transfer  inventory  between 
facilities  in  response  to  imbalanced  inventory  availability. 

( f ) Negotiated  Reductions  in  Warehousing  and  Delivery  Costs. 

The  smaller  the  number  of  distribution  centers,  the  greater 
their  individual  volume  and  hence  the  more  opportunity  there 


is  to  negotiate  more  favorable  arrangements  for  warehousing 
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and  delivery  services.  Many  modelers  are  reluctant  to  build 
such  sliding  cost  reductions  into  the  problem  data  because 
of  uncertainty  as  to  their  magnitude. 

Including  such  costs  in  the  model  tends  to  make  it  more  complete  and 
hence  a more  powerful  tool  for  problem-solving,  but  at  the  expense  of  making 
it  more  difficult  to  solve  (owing  to  nonlinearities)  and  of  rendering  it 
less  credible  (owing  to  cost/savings  uncertainties) . This  presents  a 
dilemma.  We  propose  to  deal  with  this  dilemma  by  leaving  such  costs  out  of 
the  model,  solving  it  parametrically  on  the  number  of  open  facilities,  and 

J 

then  factoring  in  the  omitted  costs  manually.  There  is  no  loss  of  global 
optimality  with  such  an  approach  so  long  as  the  omitted  costs  depend  just  on 
the  number  of  open  facilities,  and  the  manual  part  of  the  procedure  can  be 

tj 

repeated  easily  with  a variety  of  estimates  for  the  uncertain  costs. 

Suppose  that  the  total  dollar  amount  of  the  problematical  costs  is  esti- 
mated for  different  numbers  of  open  facilities.  Call  these  the  n-costs 

i 

for  want  of  a better  name,  and  make  sure  that  they  do  not  duplicate  any  cost 
components  comprising  system  costs  . The  system  design  which  minimizes  the 

overall  sum  of  system  costs  plus  n-costs  can  be  determined  by  simple 

J 

arithmetic  calculation  once  the  CvN  curve  is  in  hand.  Table  1 provides  an 
illustration.  Write  the  range  of  system  sizes  of  interest  in  the  first  col- 
umn, the  corresponding  CvN  figures  in  the  second,  and  the  corresponding  n-costs 
in  the  third.  Add  columns  2 and  3 to  obtain  column  4.  Scan  down 

column  4 to  find  the  smallest  overall  cost.  The  corresponding  system  itself 
may  already  be  known  as  a byproduct  of  having  generated  the  Cvn  curve;  other- 
wise (in  the  case  where  the  key  line  is  based  on  an  interpolation  of  the  CvN 
curve)  the  corresponding  system  can  usually  be  found  with  little  difficulty 
by  the  same  technique  used  to  generate  the  CvN  curve . 
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Number  of  Open 
Facilities 

System  Costs 

from 

CvN  Curve 

Estimated 
n- Costs 

Overal 1 

Costs 

24 

$4,402,500 

$ 80,000 

$4,482,500 

26 

4,390,500 

86,000 

4,476,500* 

28 

4,384,000 

93,500 

4,477,500 

30 

4,382,000 

100,000 

4,482,000 

32 

4,383,500 

106,500 

4,490,000 

34 

4,388,500 

113,500 

4,502 ,000 

36 

4,398,000 

120,000 

4,518,000 

least  over- 
all cost 


Table  1 


Sample  Calculation  of  the  Revised  Optimal  System  Design  When 
n-costs  are  Taken  into  Account  (only  every  2nd  line  , shown) 


It  should  be  obvious  that  nonlinearity  of  the  n-costs  offers  no 
difficulty.  Moreover,  alternative  assumptions  for  various  n-cost  com- 
ponents can  be  tried  out  easily  without  any  additional  computer  runs  (the 
CvN  curve  stays  the  same  — just  the  n-costs  vary)  . Thus  one  may  deal 
even  with  highly  speculative  costs  in  a convenient  series  of  manual  "what 
if..."  trials.  For  instance,  the  inventory  carrying  cost  component  can  be 
recalculated  for  several  different  stockout  probabilities  (say  10%,  5%,  and 
3%)  and  the  impact  on  system  design  can  thereby  be  assessed  for  this  im- 
portant measure  of  customer  service. 

The  CvN  curve  is  a remarkably  powerful  tool  when  used  in  thi  fashion. 
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4 . Theoretical  Curves 

The  general  shape  of  a System  Cost  vs.  Number  curve  can  be  derived 
analytically  using  calculus  under  highly  simplified  assumptions.  Such  an 
exercise  is  worthwhile  for  several  reasons.  First,  the  theoretical  curve 
indicates  sensible  ways  for  interpolating  between  available  points  on  the 
actual  CvN  curve  and  for  extrapolating  beyond  them.  Second,  having  the 
theoretical  curve  in  advance  facilitates  the  generation  procedure  described 
in  the  Appendix.  But  most  importantly,  a general  agreement  between  the 
theoretical  curve  and  the  real  one  suggests  why  the  CvN  curve  looks  the  way 
it  does:  the  key  cost  tradeoffs,  between  decreasing  outbound  costs  and  in- 
creasing fixed  costs  as  the  number  of  open  distribution  centers  increases, 
which  underly  the  theoretical  curve  are  probably  similar  for  the  full  scale 
model . 

It  is  a straightforward  matter  to  derive  a formula  for  the  CvN  curve 
for  the  simplified  analytical  model  developed  in  the  Appendix.  The  curve 
itself  depends  on  the  problem  data  in  a fairly  complicated  way,  but  one  can 
obtain  a good  idea  of  the  general  shape  of  the  curve  by  (i)  excluding  those 
cost  components  which  do  not  depend  on  n in  the  best  n-facility  system,  and 
(ii)  normalizing  the  curve  to  give  relative  system  cost  versus  relative 
member  of  open  facilities.  The  effect  of  (i)  is  to  exclude  variable  faci- 
lity costs  and  inbound  transportation  costs  to  the  centroid  of  each  open 
facility's  service  area.  The  remaining  "non-excluded"  costs  can  be  thought 
of,  loosely  speaking,  as  fixed  and  outbound  costs.  "Relative"  means  that 
costs  and  the  number  of  open  facilities  are  expressed  as  a fraction  of 
their  optimal  values.  Under  (i)  and  (ii) , a single  CvN  curve  covers  all 
numerical  cases-^. 

— ^See  the  Appendix  for  a derivation.  With  the  help  of  (3a) -(3d)  in  the 

Appendix  and  the  curve  shown  in  Figure  4,  it  is  a simple  matter  to  construct 
the  full  CvN  curve  with  no  cost  components  excluded. 
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The  result  is  given  in  Figure  4,  where  TC*(n)  denotes  the  total  non- 
excluded  costs  for  the  best  n-facility  system.  The  optimal  number  of  open 
facilities  is  denoted  by  n*.  Figure  4 shows  the  relative 

total  cost  TC* (n)/TC* (n*)  as  a function  of  n/n*.  Thus  if  one  contemplates 
using  the  least  cost  system  design  having  half  the  ideal  number  of  facili- 
ties (n/n*  = 0.5),  fixed  plus  outbound  costs  are  predicted  to  be  11% 
higher  than  for  the  least  cost  system  design  having  n*  facilities  (since 
TC* (n*/2)/TC* (n*)  = 1.11). 

How  closely  does  this  curve  approximate  real  CvN  curves?  Often 
fairly  well.  The  little  triangles  in  Figure  4 correspond  to  points  on 
the  real  CvN  curve  for  the  mining  company  model  mentioned  earlier.  The  fit 
is  quite  decent.  It  would  seem  the  counterplay  between  fixed  and  outbound 
costs  in  the  simplified  analytical  model  may  bear  a resemblance  to  the  situ- 
ation for  the  full  mixed  integer  linear  programming  model,  where  the  thous- 
ands of  individual  data  elements  tend  to  obscure  what  is  really  going  on. 

Not  only  can  theoretical  CvN  curves  be  derived,  but  proximity-based 
customer  service  relationships  like  the  one  in  Figure  2 can  also  be  derived 
under  simplifying  assumptions.  This  is  worthwhile  for  reasons  similar  to 
those  mentioned  earlier. 

It  is  easy  to  show  that  in  the  case  of  equal  hexagonal  service  areas 
covering  uniformly  distributed  demand,  the  median  (or  average  or  maximum) 
distance  between  customers  and  their  assigned  distribution  centers  must 


A = points  from  the  mining 
company  1 s CvN  curve 


Relative  Number  of  Open  Distribution  Centers 


Figure  4 

Normalized  Theoretical  Cvn  Curve 
(derived  in  the  Appendix) 
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vary  as  one  over  the  square  root  of  the  number  of  facilities Figure  5 
shows  this  relationship  for  the  continental  U.S.  (omitting  33%  of  the  land 
area  as  too  sparsely  populated  to  require  coverage) . The  striking  feature 
of  this  curve  is  that  the  marginal  advantage  of  additional  distribution 
centers  becomes  trifling  once  there  are  more  than  15  or  20  of  them. 

It  is  interesting  to  combine  Figures  4 and  5 together  in  the  same  way 
Figures  1 and  2 were  combined.  This  is  done  in  Figure  6,  which  shows  the 
theoretical  tradeoff  between  a proximity  measure  of  customer  service  and 
system  cost  (more  precisely,  the  fixed  plus  outbound  freight  component  of 
system  cost) . Several  curves  are  drawn  because  the  tradeoff  depends  on 
the  optimal  number  n*  of  facilities,  which  in  turn  depends  on  problem  data. 
The  numbers  written  adjacent  to  each  curve  indicate  how  many  facilities 
correspond  to  selected  points  on  the  curve . 

A conspicuous  feature  of  these  curves  is  t.nat  the  cost  penalty 
necessary  to  obtain  a significant  improvement  in  customer  service  quickly 
becomes  prohibitive  for  values  of  n greater  than  n*  by  more  than  about  20% . 


Let  n equal  hexagons,  each  centered  on  a distribution  center,  cover  an 
area  of  A square  miles.  The  median  distanc^  from  the  center  of  each 
hexagon  is  that  number  r which  satisfies  irr  =0.5  A/n,  that  is,  the 
area  within  r miles  of  the  center  of  each  hexagon  equals  half  of  the 

hexagon's  total  area.  Thus  r = /a/2tt  //n  . 


Assigned  Distribution  Center  (miles) 


Theoretical  Relationship  Between 
a Proximity  Measure  of  Customer 
Service  and  the  Number  of  Open 
Facilities  (for  the  continental 
United  States) 


Number  of  Open  Distribution  Centers 


curve  corresponds  to  a different  value  of 
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(?lxed  + Outbound  Freight)  Varies  With  a Proximity  Measure  of  Customer  Service 
[annotations  indicate  the  number  of  open  distribution  centers] 
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B.  SENSITIVITY  ANALYSIS 

Empirical  AZnACtivA-ty  anatiji-ii  is  based  on  the  idea  of  setting 
up  a family  of  runs  for  which  selected  components  of  the  data  are  al- 
tered by  a formula  that  depends  on  a single  adjustable  parameter.  To 
carry  out  a series  of  sensitivity  analysis  runs,  one  optimizes  for  each 
of  a series  of  settings  of  the  parameter.  Often  the  parameter  is  just  a 
scale  factor  for  certain  model  data,  say  outbound  costs.  Other  times 
the  parameter  enters  additively  or  in  more  complicated  ways. 

There  are  numerous  situations  where  sensitivity  analysis  is  in  order. 
One  is  when  the  modeler  is  undertain  of  the  estimates  for  certain  types 
of  model  data.  Sensitivity  analysis  on  these  data  may  show  that  there  is 
no  significant  dependence  of  the  model  solution  over  the  range  of  uncer- 
tainty; or,  in  the  event  of  significant  dependence,  knowledge  of  the 
extent  of  dependence  may  help  the  modeler  decide  how  much  additional 
effort  to  allocate  to  the  estimation  task.  Another  situation  calling  for 
sensitivity  analysis  occurs  when  the  analyst  feels  that  certain  data  will 
change  over  time  in  a certain  way.  It  is  usually  straightforward  to  de- 
termine how  the  model  solution  would  change  in  response.  A third  use, 
perhaps  the  most  important  of  all,  is  to  help  develop  insights  into  the 
workings  of  the  distribution  system  — a tool  with  which  to  find  out  the 
"whys"  behind  the  "whats"  in  each  stack  of  optimization  output. 


! 


The  variety  of  particular  types  of  sensitivity  analysis  is  poten- 
tially enormous.  Discussion  will  be  limited  to  systematic  variations  in: 


i 


* 
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. inbound  freight  rates 
. outbound  freight  rates 
. distribution  center  fixed  costs 
. distribution  center  variable  costs 
. demands 

Each  of  these  categories  involves  multiple  data  elements,  sometimes 
more  than  a thousand.  Usually  one  thinks  in  terms  of  parametric 
changes  applied  to  selected  populations  of  numbers  rather  than  to  indi- 
vidual numbers  themselves.  It  therefore  seems  natural  to  take  into  account 
how  any  proposed  parametric  change  will  alter  the  two  most  fundamental 
descriptive  measures  of  each  population:  its  mean  and  its  standard  de- 
viation. This  distinction  turns  out  to  be  particularly  appropriate  be- 
cause the  influence  of  one  of  these  measures,  the  mean,  allows  a consid- 
erable amount  of  theoretical  understanding  whereas  the  other  most  defin- 
itely does  not  (at  least  at  the  present  time) . 

Now  ordinary  multiplicative  scaling,  while  of  appealing  simplicity 
and  sometimes  having  a plausible  rationale , has  the  unfortunate  property 
that  it  factors  both  the  mean  and  the  standard  deviation  of  the  population 
so  scaled.  One  may  avoid  this  confounding  effect  by  parameterizing  in 
some  other  way  which  changes  the  mean  or  the  standard  deviation,  but  not 
both  at  the  same  time . 

The  proper  type  of  parameterization  for  studying  the  influence  of  the 
mean  (we'll  call  this  the  mean  effect)  is  simple  additive  change.  Just 
add  a constant  to  all  of  the  data  elements  to  be  changed.  The  mean  is 


correspondingly  changed  but  the  standard  deviation  is  not,  nor  is  any 
other  shape  parameter  of  the  distribution. 
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The  proper  type  of  parameterization  for  studying  the  influence  of  the 
standard  deviation  (we'll  call  this  the  dispersion  effect)  is  not  quite 
so  obvious,  but  a good  choice  is  summarized  as  follows. 

Proposition  Let  the  numbers  { x, ,x„,...,x  } have  mean  u and 

* 12  n 

standard  deviation  o . Define 

y^  = y + fHx^-V)  , i = 1,  — ,n 

where  0 > 0.  Then  the  numbers  { y„ ,y_ ,... ,y  } have  mean  u 
— 1 2 n 

and  standard  deviation  {to  . 

This  simple  parameterization  factors  the  standard  deviation  but  leaves 
the  mean  unchanged. 

Empirical  sensitivity  results  can  be  obtained  for  these  two  types 
of  parameterization  for  any  other  type)  by  brute  force,  that  is, 
by  optimizing  for  each  of  a series  of  parameter  settings.  A high  degree  of 
solution  regularity  is  often  observed  when  this  is  done,  particularly 
if  the  mean  effect  is  dominating.  This  suggests  that  idealized  relation- 
ships might  be  derived  to  guide  the  interpolation  and  extrapolation  of 
the  empirical  results.  Since  each  point  on  a sensitivity  curve  requires 
a full  optimization  run,  success  along  these  lines  could  be  of  considerable 
practical  utility.  Moreover,  the  analytical  derivations  themselves  of 
the  theoretical  sensitivity  curves  are  likely  to  deepen  our  understanding 
of  why  the  curves  have  the  general  shape  that  they  do. 

It  is  possible,  under  highly  simplified  assumptions,  to  derive  an 
analytical  expression  for  the  optimal  number  of  facilities.  Such  an  ex- 
pression permits  sensitivity  analysis  predictions  if  one  hypothesizes  that 
the  proportional  change  in  the  number  of  facilities  in  the  full  scale  model 
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will  be  the  same  as  the  proportional  change  in  the  number  of  facilities  in 
the  simplified  analytic  model. 

A useful  analytic  minimodel  for  one  plant  and  one  product  is  explained 

in  the  Appendix,  where  it  is  demonstrated  that  the  following  formula  is  a 

good  approximation  to  the  optimal  number  of  warehouses  covering  a total 
2 

area  of  A mi  at  minimum  total  cost: 

n*  = .33  A (pt/f)2/3  (1  - (r/t)2)1/2. 

The  symbols  are  defined  as  follows: 

2 

p uniform  demand  density,  in  CWT/mi  yr 
f fixed  cost  of  an  open  warehouse,  in  $/yr 
r inbound  freight  rate,  in  $/CWT  mi 
t outbound  freight  rate,  in  $/CWT  mi 


For  this  formula  it  is  easy  to  predict  the  mean  effect  of  A,  p,  r,  t, 
or  f.  For  instance,  changing  f to  f changes  n*  by  the  factor 

= .33  A (pt/f)2/3  (l  - (r/t)  2)1/2  = ~ 2/3 

n*  from  the  .33  A (pt/f)2/3  (l  - (r/t  2)1/2 

minimodel 


We  therefore  hypothesize 


number  of  warehouses  if  ? is  used 
number  of  warehouses  if  f is  used 


(f/f) 


2/3 


from  the 
full  scale 
model 


In  a similar  manner,  we  can  develop  the  following  hypothesis  for  the  in- 


fluence of  the  mean  effect  on  inbound  freight  rates: 

number  of  warehouses  if  9r  is  used i 
number  of  warehouses  if  r is  used 


from  the 
full  scale 
model 


1/2 
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These  two  formulas  ere  graphed  in  Figure  7 along  with  the  formulas 
for  the  influence  of  the  mean  effect  for  demand  density  and  outbound 
freight  rate.  The  "parameter  change  factor"  is  the  ratio  of  new  to  old 
value  for  the  parameter  of  interest;  in  the  notation  used  above,  for 
instance,  the  factor  was  f/f  in  the  case  of  fixed  cost  and  9 in  the  case  of 
inbound  freight  rate.  The  "n*  change  factor"  is  just  the  ratio  of  the  new 
value  of  n*  to  the  old  value  of  n* . Thus  the  sensitivity  curve  for  f in- 
dicates that  20%  fewer  warehouses  are  needed  when  fixed  costs  all  increase 
by  40%. 

Notice  that  there  is  just  one  sensitivity  curve  for  f and  p,  but  the 
curves  for  r and  t depend  on  the  ratio  r/t.  In  practice  this  ratio  is 
usually  well  under  1/2.  Curves  are  given  for  ratio  values  of  0,  .2,  .3, 

.4  and  .5.  The  p curve  coincides  with  the  t curve  for  r/t  = 0. 

The  analyst  should  compare  such  predictions  with  the  empirical  results 
of  sensitivity  analysis.  General  agreement  seems  to  occur  more  often  than 
not,  but  disagreement  can  also  be  valuable  because  it  forces  an  inquiry 
into  the  reasons  for  the  discrepancy. 


J 
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First  Example 

A series  of  runs  for  the  mining  company  model  was  done  in  which 
different  constants  were  added  to  each  of  the  facility  fixed  costs.  This 
is  exactly  the  additive  change  parameterization  recommended  previously 
for  studying  the  mean  effect.  In  Table  2 we  compare  the  actual  change  in 
n*  with  the  theoretical  prediction 


n* 

30 


15,800 


mean  fixed  cost 


2/3 


22  - 


The  prediction  is  quite  good  except  for  very  large  downward  changes  in 
fixed  costs,  where  boundary  effects  (there  are  only  51  candidate  sites 
available)  and  the  lower  warehouse  throughput  limits  offer  plausible  ex- 
planations for  the  discrepancy. 


Run  # 

Mean  Fixed  Cost 

Actual  n* 

Predicted 

1 

$36,600 

19 

17.1 

2 

25,000 

23 

22.1 

3 

20,000 

27 

25.6 

4 

15,800 

30 

30.0 

5 

13,500 

33 

33.2 

6 

10,000 

38 

40.7 

7 

5,000 

40 

51 

TABLE  2 


Comparison  of  Actual  and  Predicted  Changes  in  n*  Owing  to  Changes 
in  the  Mean  Fixed  Cost  (Data  for  a Mining  Company) 


Second  Example 

The  model  for  a consumer  products  manufacturer  has  a single  plant,  one 
product  group,  about  65  candidate  facility  locations,  and  about  130  cus- 
tomer zones.  An  issue  of  interest  was  the  sensitivity  of  the  full  scale 
model  to  changes  in  the  inbound  freight  rates. 


With  an  r/t  ratio  of  0.082,  the  analytic  minimodel  predicts  a very  low 
degree  of  sensitivity.  It  predicts,  for  instance,  that  a 50%  reduction  in 
inbound  freight  rates  will  increase  the  number  of  warehouses  by  only 
1/4  of  1%. 

This  prediction  was  tested  by  subtractively  reducing  inbound  freight 
rates  by  an  average  of  50%  and  then  rerunning  the  full  scale  model . The 
result  was  that  the  optimal  number  of  warehouses  increased  from  24  to  25, 
which  is  in  quite  reasonable  agreement  with  the  prediction. 

It  warrants  emphasis  that  both  the  prediction  and  the  empirical  test 
pertain  to  the  mean  effect  and  not  to  the  dispersion  effect.  We  made  sub- 
tractive rather  than  multiplicative  changes  when  setting  up  the  new  run  of 
the  full  scale  model,  but  simply  halving  all  inbound  freight  rates  — 
which,  as  we  know  from  earlier  discussion,  involves  both  mean  and  disper- 
sion effects  — increases  the  optimal  number  of  warehouses  by  4 to  28.  Thus 
the  dispersion  effect  has  an  influence  not  at  all  well  predicted  by  the 
minimodel . 

This  illustration  was  adapted  from  Ref.  4,  wherein  a somewhat  more 
elaborate  extension  of  the  minimodel  is  suggested  which  does  appear  to  give 
useful  predictions  of  dispersion  effects  as  well  as  mean  effects. 

In  closing,  it  bears  emphasis  that  minimodels  should  be  thought  of  as 
adjuncts  to,  and  not  as  replacements  for,  full  scale  models.  The  most  one 
can  hope  for  is  that  they  capture  the  essential  features  of  the  problem 
well  enough  to  exhibit  the  general  character  of  the  solution  in  a quali- 
tatively correct  way  and  to  reveal  the  basic  forces  responsible  for  the 
character  of  the  solution.  An  effective  minimodel  can  enhance  the  useful- 
ness of  a full  scale  model  in  much  the  same  way  that  a small  sighting  scope 


enhances  the  usefulness  of  a large  telescope. 
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C.  ROBUSTNESS  ANALYSIS  AND  REGRET 

Before  recommending  a reconfigured  distribution  system  it  is  wise  to 
study  its  robustness  in  the  face  of  various  departures  from  the  assump- 
tions on  which  it  is  based.  How  will  the  proposed  system  fare  five  years 
from  now  when  the  demand  pattern  has  changed?  When  a backhaul  revenue 
development  program  reaches  maturity?  How  would  the  proposed  system  fare 
if  outbound  freight  rates  increase  twice  as  fast  as  inbound  freight  rates? 
If  that  perennial  possibility  of  acquiring  a new  plant  on  the  West  Coast 
should  materialize? 

Such  questions,  and  many  others  like  them,  challenge  the  validity  of 
any  proposed  redesign  of  a distribution  system.  Sometimes  they  can  be 
disposed  of  by  logical  argument  or  manual  analysis  without  the  need  to 
perform  additional  optimization  runs.  But  often  the  only  way  to  address 
them  properly  is  by  some  type  of  robustness  analysis  performed  on  the 
full  scale  model. 

The  typical  robustness  question  is  of  the  following  type: 

"PxopoAed  intern  A u>  but  ion.  model  data  icenanlo  X; 
would  intern  A Atill  be  neanly  but  Aome  otiien  data 
icenanlo  V should  matenlaJLize ?" 

A typical  naive  approach  to  this  question  is  to  optimize  using  data 
scenario  Y to  find  the  best  system  B in  that  case.  Systems  A and  B are 
then  compared;  if  the  differences  look  fairly  "small",  then  system  A is 
declared  to  be  "robust"  with  respect  to  Y.  Another  common  naive  approach 
is  to  price  out  (no  optimization)  system  A using  scenario  Y;  if  the  cost 
doesn't  change  much,  system  A is  declared  to  be  robust. 
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Both  of  these  approaches  can  be  misleading.  The  trouble  with 
them  is  that  they  do  not  measure  the  regret  that  would  be  felt  if  system 
A were  implemented  but  then  scenario  Y actually  materializes.  Regret 
in  this  eventuality  is  the  difference  between  the  cost  that  would  be  in- 
curred by  system  A under  scenario  Y and  the  cost  that  would  occur  under 
scenario  Y if  system  B (the  best  one  under  scenario  Y)  were  in  place. 

Two  runs  are  necessary  to  answer  the  robustness  question  properly 


in  terms  of  cost  regret: 

1st  Run:  Find  the  best  system  B under  scenario  Y;  let  its 
total  system  cost  be  TC*(b]y). 

2nd  Run:  Lock  in  those  aspects  of  system  A's  design  which 
could  not  be  changed  easily  in  the  event  that  A 
were  implemented  but  scenario  Y actually  occurs ; 


optimize  over  the  remaining  aspects  of  system 
design  using  the  data  of  scenario  Y.  Let  TC*(a|y) 
be  the  resulting  total  system  cost. 

Regret  = TC* (a| Y) -TC* (B | Y) , or 
Relative  Regret  = Regret/TC* (B | Y) 

Notice  that  some  judgement  is  necessary  in  the  second  run.  Typically, 
since  facility  locations  are  expensive  or  organizationally  upsetting  to 
change,  these  locations  would  be  locked  up  whereas  customer  zone  assign- 
ments and  transportation  flows  would  not  be  (they  would  be  optimized 


during  the  second  run) . 


The  trouble  with  the  first  naive  approach  mentioned  is  that  it  com- 
pares systems  A and  B but  not  their  costs.  It  is  quite  possible  for  A 
and  B to  look  quite  similar  and  yet  be  far  apart  cost-wise,  or  for  A and 
B to  look  quite  dissimilar  and  yet  be  very  close  cost-wise  (i.e.,  for 
the  regret  to  be  small) . The  trouble  with  the  second  naive  approach  is 
that  it  provides  no  basis  for  judgement;  who  can  say  whether  TC*(a]y)  is 
"good"  unless  TC*(b|y)  is  known? 

If  regret  turns  out  to  be  small,  fine.  But  what  if  it  turns  out  to 
be  unacceptable?  It  will  then  be  necessary  to  think  about  how  and  why 
systems  A and  B differ,  and  to  attempt  a compromise  design  which  performs 
acceptably  well  under  both  scenarios  X and  Y.  Much  depends,  of  course, 
on  the  estimated  liklihood  that  scenario  Y will  materialize.  The  concept 
of  regret  continues  to  play  a fundamental  role. 

Example 

The  model  for  a major  auto  parts  manufacturer  had  9 product  groups, 

3 plants,  35  candidate  distribution  center  locations,  and  144  customer 
zones.  Improved  system  designs  were  developed  under  two  major  options, 
namely  whether  or  not  there  will  be  a highly  automated  national  "super- 
depot". Among  the  robustness  studies  deemed  advisable,  it  was  decided  to 
study  the  influence  of  a 25%  increase  in  all  transportation  rates  on 
Recommended  System  A (no  superdepot) . 

A first  optimization  run  was  done  with  all  transportation  rates  duly 

factored  by  1.25,  and  with  all  distribution  centers  free  to  open  or  close 

except  for  the  candidate  superdepots.  The  result  is  schematized  below, 

where  "1"  stands  for  an  open  facility  and  "0"  stands  for  a closed  facility 

Recommended  System  A 111100011111111100000000000000000000000 
Optimal  System,  1st  Run  000011111111111100000000000000000000000 
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Evidently  the  increased  transportation  costs  induce  very  substantial  changes 
in  the  least  cost  system  configuration.  It  would  appear  that  Recommended 
System  A is  not  robust  in  the  face  of  increased  transporation  costs. 

A second  run  was  then  done  with  the  same  cost  structure  but  with  all 
facility  locations  fixed  per  Recommended  System  A.  Thus  the  only  optimi- 
zation performed  in  this  run  was  the  rebalancing  of  all  transportation  flows 
(especially  the  facility  service  areas)  in  response  to  the  increased  trans- 
portation costs.  The  resulting  total  system  cost  was,  surprisingly,  just  a 
shade  less  than  1%  higher  than  the  total  system  cost  under  the  first  run. 
Thus  Recommended  System  A was  shown  to  be  reasonably  robust  after  all  in 
the  face  of  increased  transportation  costs! 

This  example  illustrates  the  pitfall  of  the  first  naive  approach  to 
robustness  analysis  mentioned  earlier.  The  solution  itself  changed  substan- 
tially when  transportation  rates  went  up  25%,  but  in  terms  of  monetary 
regret  this  change  was  relatively  insignificant. 


1 


A 


D.  IMPLEMENTATION  PRIORITY  ANALYSIS 


Among  the  last  runs  to  be  done  are  those  which  attempt  to  provide 
guidance  to  management  in  setting  implementation  priorities  for  the 
various  system  changes  which  have  emerged  as  desirable.  Toward  this  end 
it  is  appropriate  to  study  the  impact  of  these  changes  individually  and 
in  combination  with  one  another.  Generally  speaking,  those  which  prove 
to  be  responsible  for  greater  improvements  in  overall  system  performance 
should  be  put  into  practice  before  those  which  yield  lesser  improvements. 
Some  changes  may  even  prove  to  be  of  such  small  value  that  their  implement- 
ation will  be  postponed  indefinitely  as  not  worth  the  organizational  upset, 
or  at  least  held  in  abeyance  pending  restudy  with  updated  data  at  some 
future  date.  Underlying  the  need  for  such  implementation  priority  studies 
are  the  sobering  realities  of  what  is  possible  in  the  near  term  in  terms 
of  capital  expenditure  approval,  politics,  organizational  change  and 
customer  acceptance . 

It  is  important  to  understand  that  some  departures  from  any  given 
system  configuration  are  usually  possible  for  a negligible  cost  penalty. 

For  instance,  the  substitution  of  Macon  for  Savannah  as  the  site  for  a 
Southeastern  distribution  center  (DC)  may  well  have  a very  small  impact  on 
cost  and  customer  service.  The  combinatorial  richness  of  DC  location 
patterns,  customer  zone  assignment  possibilities,  and  transportation  flows 
virtually  guarantee  that  the  particular  system  configuration  to  come  out 
of  any  given  run  is  just  one  of  numerous  alternatives  with  nearly  equal 
cost.  The  aim  of  implementation  priority  analysis  is  to  exploit  this  fact 
by  searching  among  these  alternatives  for  ones  that  are  superior  with  re- 
spect to  such  hard-to-quantify  criteria  as  those  mentioned  at  the  end 


of  the  previous  paragraph. 
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These  remarks  apply  with  particular  force  to  the  important  case  of 
evaluating  proposed  changes  in  the  number  and  location  of  DC's.  Since  the 
focus  is  on  how  the  current  locational  configuration  will  be  changed,  it 
is  usually  wise  to  represent  the  proposed  locational  configuration  as  the 
current  configuration  plus  specific  "changes”,  where  each  change  is  one  of 
three  types : 

(a)  an  isolated  closing  of  a current  DC 

(b)  an  isolated  opening  of  a new  DC 

(c)  the  "moving"  of  a current  DC  to  a new  location. 

Usually  such  a representation  is  evident  with  the  help  of  a map,  although 
it  may  not  be  unique.  As  a hypothetical  example: 

Change 

1.  Close  Columbus  (current) 

2.  Close  Boston  (current) 

3.  Open  Miami  (new) 

4.  Move  San  Francisco  (current)  to  Los  Angeles  (new) 

The  impact  of  any  specific  change  or  combination  of  changes  cam  be 
evaluated  by  doing  a run  in  which  the  current  configuration  is  modified 
accordingly.  All  candidate  locations  would  be  locked  open  or  locked 
closed—^,  but  optimization  would  still  take  place  with  respect  to  customer 
zone  assignments,  DC  sizes,  transportation  flows,  and  plant  loadings. 
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None  would  be  free  unless  there  are  discrete  size  alternatives  for  a 
DC  at  a site  that  is  to  be  used,  and  it  is  desired  to  allow  automatic 
selection  of  the  best  size  alternative. 


Thus  if  we  compare  a base  case  run  locking  in  the  current  locational  con- 
figuration with  another  run  locking  in  the  configuration  modified  to  close 
Columbus,  say,  the  consequent  difference  in  total  system  costs  is  a true 
and  comprehensive  evaluation  of  the  cost  impact  of  closing  Columbus,  It  is 
not  just  a first  approximation  of  the  estimated  savings  such  as  one  could 
get  by  costing  out  a plausible  manual  reassignment  of  Columbus'  customers 
to  other  nearby  DC's;  rather,  it  takes  full  account  of  all  ripple  and 
interaction  effects  by  making  exhaustive  use  of  the  adaptive  capabilities 
of  the  system  (i.e.,  by  optimally  adjusting  customer  assignments,  DC 
sizes,  transportation  flows,  and  plant  loadings).  Four  runs  beyond  the 
base  case  run  will  reveal  the  cost  impact  of  each  of  the  four  hypothetical 
changes  taken  individually,  and  a few  more  runs  should  suffice  to  es- 
tablish the  appropriate  implementation  priority. 

Example 

The  model  for  a food  company  had  17  products,  14  plants,  45  candidate 
distribution  center  locations,  and  121  customer  zones  (refer  to  Refs.  1 
and  3 for  further  details) . Implementation  priority  analysis  focused 
mainly  on  the  impact  of  distribution  center  location  changes  on  total 
system  cost,  since  this  was  the  primary  management  concern  and  the  deter- 
minant of  capital  requirements  and  organizational  dislocations.  The  most 
economical  reconfiguration  required  six  changes  to  the  current  locational 
configuration.  These  six  changes,  which  will  not  be  detailed  here,  were 
examined  individually  and  in  various  combinations. 

The  results  of  the  priority  analysis  applied  to  the  six  changes  are 
summarized  in  Table  3.  The  first  row  indicates  that  the  least  cost  system 
has  a total  normalized  cost  of  100.  Run  A shows  that  the  current  system 
at  the  time  of  the  study  had  a total  cost  of  103.15.  From  run  B it  can  be 


seen  that  the  total  cost  associated  with  the  current  DC  locations  could  be 
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reduced  by  1.72  by  optimizing  the  DC  service  areas  and  transportation 
flows.  Runs  B.1-B.6  show  the  consequence  of  implementing  each  of  the  DC 
location  changes  individually.  Changes  3 and  6 appear  to  be  quite 
attractive,  changes  2 and  5 only  moderately  attractive,  and  changes  1 
and  4 are  unattractive  (presumably  because  they  require  the  simultaneous 
presence  of  another  change  to  be  worthwhile) . A further  consideration 
was  that  change  5 would  give  additional  warehouse  space  in  a geographical 
region  where  it  was  particularly  needed.  Thus  changes  3,  5 and  6 were 
quite  attractive  at  this  stage  of  the  analysis. 

Another  way  of  looking  at  the  economic  value  of  the  more  doubtful 
changes  1,  2 and  4 is  to  omit  (rather  than  include)  them  one  at  a time. 
This  was  done  in  runs  B.7-B.9,  which  selected  the  best  subset  of  the  five 
changes  remaining  after  each  dubious  change  was  omitted.  This  required 
placing  appropriate  candidate  locations  in  a "free"  rather  than  locked 
status  in  each  run.  The  results  of  these  runs  tend  to  support  the  adop- 
tion of  changes  3,  5 and  6 on  a first  priority  basis.  Run  C shows,  in 
fact,  that  changes  3,  5 and  6 together  save  a little  more  than  would  be 
expected  by  adding  up  their  one-at-a-time  savings  ( .29t.06+. 72=1. 07  < 1.13) . 

With  changes  3,  5 and  6 decided  upon,  the  other  changes  were  again 
examined  individually.  Runs  C.1-C.3  reveal  that  change  1 is  still  border- 
line, while  2 and  4 now  look  worthwhile.  This  conclusion  is  further 
supported  by  runs  B.7-B.9,  because  it  turns  out  that  the  results  of  these 
runs  would  have  been  the  same  if  changes  3,  5 and  6 had  been  mandatory. 

In  light  of  this  and  of  other  considerations  outside  the  scope  of  the 
model,  second  priority  was  given  to  changes  2 and  4 and  change  1 was 
dropped  from  further  serious  consideration  (run  D shows  that  omitting 
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change  1 incurs  an  economic  penalty  of  only  1/100  of  1%)  . 
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APPENDIX 

AN  ANALYTICAL  MINIMODEL 

It  is  possible  under  very  highly  simplified  assumptions  to  write  down 
a "minimodel"  that  can  be  solved  analytically  for  just  about  any  result  of 
interest  — the  optimal  number  of  warehouses,  the  annual  level  of  each 
cost  category,  the  sensitivity  with  respect  to  any  data  coefficient,  and 
so  on.  The  potential  advantages  of  using  minimodels  in  conjunction  with 
full  scale  mathematical  programming  models  were  discussed  along  with 
several  examples  in  Ref.  4. 

This  Appendix  presents  a minimodel  that  combines  the  first  two  out  of 
the  three  minimodels  given  in  Ref.  4.  We  have  found  it  a useful  adjunct  to 
a number  of  different  strategic  planning  models  and  have  therefore  selected 
it  as  the  basis  for  several  results  stated  in  the  main  text. 

The  Problem 

Consider  the  case  where  a single  plant  with  fixed  location  ships  a 
standard  mix  of  products  to  a number  of  warehouses  which  in  turn  ship  to 
customers.  Make  the  following  assumptions: 

(a)  Demand  is  uniformly  distributed  on  the  plane  with  a 

2 

density  of  p cwt/mi—  yr. 

(b)  The  inbound  freight  rate  from  the  plant  to  each  warehouse 
is  r $/cwt-mi . 

(c)  The  outbound  freight  rate  from  each  warehouse  is 
t $/cwt-mi . 

(d)  The  fixed  cost  for  each  open  warehouse  is  f $/yr. 
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(e)  The  variable  throughput  cost  of  each  warehouse  is 
v $/cwt. 

(f)  Candidate  warehouses  are  all  identical  with  respect  to 
cost  characteristics,  have  no  throughput  limits,  and 
are  located  densely  in  the  plane. 

The  objective  is  to  select  which  of  the  candidate  warehouses  to  open 
so  as  to  cover  a total  area  of  A square  miles  with  non-overlapping  equal- 
sized square  service  areas  at  minimal  total  cost. 

The  choice  of  square  service  areas  is  a matter  o'"  convenience  as  it 
can  be  shown,  somewhat  surprisingly,  that  requiring  circular  or  hexagonal 
service  areas  would  yield  the  same  results  for  all  practical  purposes. 
Assumption  (f)  and  this  insensitivity  to  the  exact  shape  of  the  service 
areas  imply  that,  practically  speaking,  revising  assumption  (a)  to  distri- 
bute demand  uniformly  over  the  continental  U.S.  rather  than  over  the  plane 
would  yield  the  same  results  provided  at  least  a dozen  or  so  warehouses  are 
used. 

Statement  of  Main  Results 

Let  n be  the  number  of  open  warehouses.  A good  approximation  to  the 
value  of  n which  minimizes  total  annual  cost  is 

(1)  n*  = .332A  (pt/f)2/3  (r/t) 2 

so  long  as  0 < r/t  <_  1/2  and  n*  ^10.  The  ratio  r/t  almost  always  falls 
toward  the  low  end  of  this  range  in  practical  applications  because  inbound 
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transportation  tends  to  involve  much  larger  shipments  than  outbound 
transportation—^ . 

Fractional  values  of  n*  may  occur  in  (1) , and  to  b*  strictly  correct  one 
would  then  have  to  compute  total  cost  for  the  two  integers  on  either  side 
of  n*  and  choose  the  better  of  the  two.  But  for  the  range  of  applicability 
of  (1),  where  n*  >_  10,  it  seems  safe  to  speak  informally  of  fractional  n* . 

The  n*  warehouses  should  be  arranged  about  the  plant  in  such  a manner 
that  the  union  of  their  (equal-sized)  square  service  areas  is,  loosely 
speaking,  a tightly-packed  figure  that  resembles  as  Glosely  as  possible  a 
circle  centered  on  the  plant  with  area  A square  miles.  A more  precise 
prescription  for  optimal  arrangement  of  the  n*  warehouses  will  be  given  in 
the  course  of  the  proof.  Some  examples  are  given  in  Figure  A for  selected 
values  of  n. 


n=10  n=19  n=25 


Figure  A 

Examples  of  Optimal  Service  Area  Arrangement  Relative  to  the 
Plant  (small  circle)  for  Different  Numbers  of  Warehouses  (n) 


i^or  example,  a recent  issue  of  InduAtAy  Week  (20  February  1978,  p.  80)  gives 
average  cent/ton  mile  figures  of  7.0  for  truck  and  1.4  for  rail,  which  would 
yield  an  r/t  of  0.2  if  all  inbound  freight  goes  by  rail  and  outbound  by  truck 
Our  practical  experience  indicates  that  r/t  is  usually  closer  to  0.1  than  to 
0.2,  owing  to  less-than-truckload  shipments  and  shorter  distances  traveled 
on  the  outbound  side. 
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The  warehouses  are  not  located  exactly  at  the  center  of  their  respec- 
tive service  areas,  but  rather  are  displaced  somewhat  in  the  direction  of 
the  plant.  The  optimal  amount  of  displacement  is  approximately 

(2)  .572  ^ j /A/n*  miles. 

Notice  that  /A/n*  can  be  interpreted  as  the  length  of  each  side  of  a 
service  area. 

The  components  of  total  annual  cost  associated  with  the  best  possible 
n-warehouse  system  (n  need  not  equal  n*)  are  as  follows: 


(3a) 

nf 

fixed  warehouse  cost 

(3b) 

pAv 

variable  warehouse  cost 

(3c) 

pAr  (.373 

- .572 

f ^ ) 

inbound  cost 

2 

L. 

- 

2 n 

3/4  ? 

(3d) 

PA  /A/n 

{.572  | . 

. 3826t 

1 - i 

ft  | H 

1 

j outbound  cost. 

The  corresponding  system  may  not  have  the  optimal  number  of  warehouses, 
but  it  is  configured  optimally  relative  to  the  plant  given  that  there  must 
be  n facilities.  Each  facility  is  displaced  optimally  from  the  center  of 
its  service  area;  the  amount  of  displacement  is  given  by  (2)  with  n*  re- 
placed by  n.  Observe  that  (3c)  and  (3d)  each  contain  a term  that  is 
identical  except  for  the  sign.  This  term  accounts  for  the  cost  influence 
of  warehouse  displacement  away  from  the  center  of  the  service  areas  in  the 
direction  of  the  plant. 

From  (3)  it  is  straightforward  to  obtain  the  associated  System  Cost 
vs.  Number  curve.  Leaving  aside  for  the  sake  of  simplicity  those  costs 
which  do  not  depend  on  n (i.e.,  variable  warehouse  costs  and  the  first 
component  of  inbound  cost)  and  denoting  the  sum  of  the  remaining  costs  as 
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TC*(n),  the  normalized  curve  has  the  following  formula: 


(4) 


TC*  (n)  = 1 / n_ \ 2 ( n_ . 1/2 

TC*(n*)  3 y n*  / 3 y n*  J 


This  relation  is  graphed  in  Figure  4.  (It  would  be  easy  to  develop  a normc-i- 
ized  curve  for  which  TC*(n)  includes  all  cost  comptnents,  but  then  the  final 


result  would  not  be  independent  of  the  problem  data.) 
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The  only  real  task  is  to  justify  (3c)  and  (3d) . Relation  (1)  is 
obtained  simply  by  setting  the  first  derivative  with  respect  to  n of  (3a)  + 
(3b)  + (3c)  + (3d)  equal  to  zero  (total  cost  as  a function  of  n is  convex, 
as  follows  from  the  positivity  of  the  second  derivative  for  all  n >_  0)  . 
Relation  (2)  and  the  optimal  arrangement  of  warehouses  about  the  plant  will 
follow  from  the  analysis  leading  to  (3c)  and  (3d).  Relations  (3a)  and  (3b) 
are  immediate.  Relation  (4)  can  be  demonstrated  as  follows: 


TC* (n) 


TC* (n*) 


TC*  (n*  . n/n*)/n* 
TC* (n*)/n* 


(n/n*)f  + .3826pA3/2 

t 

[>  - (if] 

3/4 

(n/n*)~  1/2 (n*)"  3/2 

f + .3826PA3/2 

t 

r-  < 

»21 

3/2 

(n*r  3/2 

j 

(n/n*)f  + 2(n/n*)~  1/2 f 


f + 2f 


L (n\  + l (n\ 

3 \n*J  3 \n*  J 


- 1/2 


3/2 


factors  in  order  to  obtain  the 


where  (1)  was  used  to  eliminate  the  (n*) 
third  equality. 

An  essential  step  toward  justifying  (3c)  and  (3d)  is  to  be  able  to 
determine  the  optimal  location  of  a single  warehouse  when  the  locations  of 
its  service  area  and  the  plant  are  both  given.  The  situation  is  depicted 
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in  Figure  B,  where  the  plant  is  P miles  from  the  center  of  an  A/n  square 
mile  service  area  and  the  warehouse  is  displaced  D miles  in  the  direction 
of  the  plant. 


Service 

Area 


Figure  B 

A Plant,  A Warehouse  and  It's  Service  Area 
The  inbound  freight  cost  for  the  situation  of  Figure  B is 


(5)  p - (P  - D)  r 

n 

and  the  outbound  freight  cost  is 


(6) 


p | E(D 


/A/n) t 


where  E(D;  /A/n)  is  the  average  distance  between  the  warehouse  and  the 

customers  in  its  service  area  (remember  that  demand  is  uniformly  distri- 
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buted) . An  analytical  expression  for  the  function  E(  - ; • ) is  known—  . 
This  makes  it  possible  to  deal  with  the  problem  of  finding  that  value  of 
D which  minimizes  the  sum  of  (5)  and  (6) . The  problem  can  be  written 
after  elementary  manipulation  as 
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—See  expression  (8.9)  on  p.  156  of  S.  Eilon,  C.D.T.  Watson-Gandy , and 
N.  Christofides , Distribution  Management,  Hafner  Publishing  Company, 
New  York,  1971.  A partial  tabulation  appears  on  p.  157  . 


Clearly,  the  optimal  solution  depends  only  on  the  quantities  r/t  and 
*'A/n  , and  the  second  of  these  can  be  absorbed  into  the  variable  of 

minimization.  Straightforward  numerical  calculations  yield  this  simple  but 
accurate  approximation  to  the  optimal  value  of  D for  0 r/t  <_  1/2 : 


D*  a .572  (r/t)  /h/rx  . 


The  error  is  less  than  2%,  and  approaches  this  level  only  for  r/t  toward 
the  upper  end  of  its  range.  The  same  numerical  calculations  also  yield 
the  following  approximation  to  the  minimal  value  of  the  quantity  in  braces 


in  (7) 


.3826  [j.  - (r/t)2  J 3/4  . 


The  error  is  less  than  1%  for  0 <_  r/t  <_  1/2 . 

Relations  (8)  and  (9)  completely  solve  the  problem  of  finding  the 
displacement  D in  Figure  B which  minimizes  inbound  plus  outbound  freight 
costs  over  the  practical  range  0 <_  r/t  <_  1/2:  the  optimal  displacement  is 
given  by  (8) , the  associated  inbound  freight  cost  is 

(5*)  p - r ( P - .572  £ ) , 

n \ t ' 


and  the  associated  outbound  freight  cost  is 


<S)  ( 


/ “ r~  2 

! .572  £ + . 3826t  [_  1 - (r/t) 
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This  proves  (2) . It  also  proves  (3d) , which  is  just  n times  (6*) . Simi- 
larly, the  second  part  of  (5*)  proves  the  second  part  of  (3c) . 

It  remains  to  demonstrate  the  first  part  of  (3c) , namely 


P A r (.373  Jh  ) 


A 
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and  also  to  show  how  the  warehouse  service  areas  should  be  arranged  with 
respect  to  the  plant.  These  two  issues  are,  in  fact,  inextricably 
interrelated.  Notice  that  P depends  on  the  placement  of  the  service  area 


with  respect  to  the  plant.  Let  P* . denote  this  distance  for  the  jth 

in 


service  area  under  the  assumption  that  n service  areas  are  arranged  so  as 
to  minimize  the  sum  of  their  distances  from  the  plant.  Then  the  first  part 
of  (5*)  yields 

A n 

(ID  P - r 1 P*.  , 

n j=l  -*n 

which  differs  from  (10)  and  hence  requires  reconciliation.  That  is, 


we  would  like  to  show  that 


_ A 1 j • 

(12)  P*(n,A)  = — £ P*  approximately  equals  .373  ‘A 

n j=l  3n 


The  quantity  P*  (n,A)  is  interpreted  as  the  minimum  average  distance  in 
miles  between  the  plant  and  the  centers  of  n square  service  areas  covering 
a total  of  A square  miles,  over  all  possible  non-overlapping  arrangements 
of  the  service  areas. 

It  is  necessary  to  study  how  P*  varies  with  n and  A.  Behavior  with 
respect  to  A is  simple:  from  elementary  dimensional  considerations, 

P*  (n,A)  = P*  (n,l)  A 


(13) 


(just  exploit  the  arbitrariness  of  the  mile  as  a unit  of  length) . Behavior 
with  respect  to  n is  more  complicated.  Figure  A shows  service  area 
arrangements  that  appear  to  be  optimal  for  n equal  to  10,  19  and  25.  It 
is  straightforward  to  verify  that  P*  (n,A)  equals  .3686  A,  .3698  A 
and  .3748  A , respectively,  for  these  values  of  n.  The  geometry  of 
possible  arrangements  of  n service  areas  is  such  that  P*  (n,l)  does  not  be- 
have smoothly  as  a function  of  n,  but  there  is  considerable  regularity  and 
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convergence  does  occur  to  the  asymptotic  value  (2/3)/  = .37613  as 

n -*  » . The  reason  for  this  particular  value  is  that  it  equals  the  average 
distance  from  the  center  of  a disk  of  unit  area  to  a uniform  distribution 
of  points  on  it  (as  n becomes  large,  the  centers  of  the  optimally  placed 
service  areas  approach  a uniform  circular  distribution  for  the  obvious 
reason) . 

Extensive  hand  calculations  show  that  the  convergence  of  P*  (n,l)  to 
the  asymptotic  value  is  quite  rapid,  following  the  approximate  relationship 

(14)  P*  (n,l)  s .376  (1  - 1/7.25  n)  for  n 10  . 

This  estimate  is  good  to  within  1%  or  so.  Evidently,  there  is  a "small  n 
effect"  which  works  to  diminish  inbound  freight  costs  below  their  asymptotic 
level,  this  effect  diminishing  rapidly  as  n increases  (e.g.,  the  cost  (11) 
is  2%  below  its  asymptotic  value  for  n = 10,  1%  below  for  n = 20,  .5%  below 
for  n = 40) . 

The  desired  relation  (12)  is  now  at  hand  from  (13)  and  (14)  if  we  take 
n = 18  in  (14)  as  a plausible  working  approximation.  It  can  be  shown  that 
this  working  approximation  introduces  a virtually  negligible  error  in  the 
various  results  which  it  affects.  This  completes  the  verification  of  rela- 
tions (1)  - (4)  and  the  rationalization  of  service  area  arrangement 
relative  to  the  plant. 


